Rank in Wordlist | Frequency | Word |
---|---|---|
126753 | 25 | 100,000 |
133325 | 23 | 10,000 |
135911 | 23 | مناسبت,سیاسی |
205146 | 12 | وي,با |
235057 | 9 | است,و |
249572 | 8 | 200,000 |
249824 | 8 | A,B,C |
260691 | 8 | صبر,سحر |
270408 | 7 | 40,000 |
270485 | 7 | 7,500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
3896 | 5079 | امام(ره |
5547 | 3203 | رضا(ع |
6239 | 2750 | خميني(ره |
6979 | 2366 | حسين(ع |
7326 | 2211 | پيامبر(ص |
7432 | 2169 | علي(ع |
11705 | 1137 | زهرا(س |
12602 | 1018 | صادق(ع |
12746 | 1000 | علی(ع |
13989 | 870 | خمینی(ره |
Rank in Wordlist | Frequency | Word |
---|---|---|
14835 | 798 | ايسنا)، |
23271 | 401 | ع)، |
29381 | 275 | امام(ره)، |
31633 | 246 | ره)، |
34136 | 217 | است)، |
35835 | 201 | خميني(ره)، |
39168 | 175 | پيامبر(ص)، |
40730 | 164 | رضا(ع)، |
42495 | 153 | ص)، |
45277 | 138 | حسين(ع)، |
Rank in Wordlist | Frequency | Word |
---|---|---|
28983 | 281 | 100% |
40187 | 167 | 50% |
52750 | 107 | 80% |
53669 | 104 | 90% |
56365 | 96 | 10% |
62432 | 81 | 20% |
64970 | 76 | 70% |
74841 | 60 | 30% |
78074 | 56 | 25% |
83540 | 50 | 40% |
Rank in Wordlist | Frequency | Word |
---|---|---|
52443 | 108 | AT&T |
63430 | 79 | R&D |
249831 | 8 | AT&T، |
270683 | 7 | Drag&Drop |
270959 | 7 | S&P |
296815 | 6 | R&B |
300654 | 6 | اس&جی |
375790 | 4 | 4&5 |
376323 | 4 | A&M |
376454 | 4 | B&N |
Rank in Wordlist | Frequency | Word |
---|---|---|
249315 | 8 | $$ |
333693 | 5 | Ðe$igNER |
374627 | 4 | $$$ |
375248 | 4 | 200$ |
375818 | 4 | 40000$ |
442413 | 3 | $) |
442414 | 3 | $. |
442415 | 3 | $A |
444583 | 3 | 5000$ |
558030 | 2 | $$$$ |
Rank in Wordlist | Frequency | Word |
---|---|---|
821026 | 1 | $$" |
821027 | 1 | $$$" |
821082 | 1 | %" |
Rank in Wordlist | Frequency | Word |
---|---|---|
103699 | 35 | Children's |
165614 | 16 | I'm |
166349 | 16 | است'، |
191949 | 13 | شعار'جهاد |
207000 | 11 | Don't |
208610 | 11 | اقتصادي'، |
250036 | 8 | It's |
270817 | 7 | L'art |
271149 | 7 | it's |
296324 | 6 | Assassin's |
Rank in Wordlist | Frequency | Word |
---|---|---|
13003 | 971 | 5/1 |
14900 | 792 | 5/2 |
20413 | 493 | 5/3 |
29373 | 275 | 5/4 |
31517 | 248 | ۵/۱ |
34327 | 215 | 5/5 |
35687 | 202 | 2/1 |
36343 | 196 | 5/7 |
36458 | 196 | ۵/۲ |
39041 | 175 | 5/6 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots